Automatic document navigation for digital content remastering

نویسندگان

  • Xiaofan Lin
  • Steven J. Simske
چکیده

digital content re-mastering, document structure analysis, print on demand, content linking, OCR This paper presents a novel method of automatically adding navigation capabilities to re-mastered electronic books. We first analyze the need for a generic and robust system to automatically construct navigation links into re-mastered books. We then introduce the core algorithm based on text matching for building the links. The key features of the system are also described with a discussion of the experimental results on the MIT Press corpus.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supporting Early Document Navigation with Semantic Zooming

Traditional digital document navigation found in Acrobat and HTML document readers performs poorly when compared to paper documents for this task. We investigate and compare two methods for improving navigation when a reader first views a digital document. One technique modifies the traditional scrolling method, combining it with Speed-Dependent Automatic Zooming (SDAZ). We also examine the eff...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

A Semi-supervised Approach for Improving Search, Navigation and Data Quality in Autonomous Digital Libraries

The current rapid uptake of Autonomous Digital Libraries [56] (both in scholarly and generalist domains) has driven the need for automated procedures for extracting, processing and representing the digital information contained in these digital repositories. Concurrently, the development of Web 2.0 technologies and applications has provided new opportunities and challenges for web-based informa...

متن کامل

Towards Automatic Content Tagging - Enhanced Web Services in Digital Libraries using Lexical Chaining

This paper proposes a web-based application which combines social tagging, enhanced visual representation of a document and the alignment to an open-ended social ontology. More precisely we introduce on the one hand an approach for automatic extraction of document related keywords for indexing and representing document content as an alternative to social tagging. On the other hand a proposal fo...

متن کامل

Towards Rapid Generation and Visualisation of Large 3D Urban Landscapes for Mobile Device Navigation

In this paper a procedural 3D modelling solution for mobile devices is presented based on scripting algorithms allowing for both the automatic and also semi-automatic creation of photorealistic quality virtual urban content. The combination of aerial images, GIS data, 2D ground maps and terrestrial photographs as input data coupled with a user-friendly customized interface permits the automatic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004